CLAIMS 

What is claimed is: 

1. An integrated voice activation detector for 
detecting whether voice is present, the integrated voice 
activation detector comprising: 

a semiconductor integrated circuit including, 

at least one signal processing unit to perform 
voice detection; and 

a processor readable storage means to store 
signal processing instructions for execution by the 
at least one signal processing unit to: 

detect whether noise is present to 
determine whether a noise flag should be set; 

detect a predetermined number of zero 
crossings to determine whether a zero crossing 
flag should be set; 

detect whether a threshold amount of 
energy is present to determine whether an 
energy flag should be set; 

detect whether instantaneous energy is 
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present to determine whether a instantaneous 
energy flag should be set; and 

utilize a combination of the noise, 
zero crossing, energy, and instantaneous energy 
flags to determine whether voice is present. 

2. The integrated voice activation detector of 
claim 1, wherein the signal processing instructions 
further for execution by the at least one signal 
processing unit to, perform fast Fourier transformation 
(FFT) processing to determine whether a FFT flag should 
be set. 

3. The integrated voice activation detector of 
claim 1, wherein the signal processing instructions 
further for execution by the at least one signal 
processing unit to, perform an interim voice activity 
decision, a interim voice activity decision flag being 
set to indicate voice has been detected by determining if 
the instantaneous energy flag is set or the energy flag 
is set and the noise flag is not set and the zero 
crossing flag is not set. 
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1 4 . The integrated voice activation detector of 

2 claim 3, wherein the signal processing instructions 

3 further for execution by the at least one signal 

4 processing unit to, perform HangOver and Speech Kick in 

5 processing after the interim voice activity decision has 

6 been made to determine whether a voice activity flag 

7 should be set or cleared. 

1 5. The integrated voice activation detector of 

2 claim 4, wherein the signal processing instructions 

3 further for execution by the at least one signal 

4 processing unit to, if the voice activity flag is set, 

5 send a speech payload to be packet i zed and update the 

6 voice activity detection flag for external interaction 

7 with other functions of the semiconductor integrated 

8 circuit . 

1 6. The integrated voice activation detector of 

2 claim 4, wherein the signal processing instructions 

3 further for execution by the at least one signal 

4 processing unit to, if the voice activity flag is not 
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6 



set, disable an automatic level control and cause a 
silence insertion description payload to be prepared. 



1 7 . The integrated voice activation detector of 

2 claim 1, wherein detecting a predetermined number of zero 

3 crossings to determine whether a zero crossing flag 

^ 4 should be set includes determining whether a root mean 

.si 5 square crossing value is greater than a threshold value. 

yy 
III 

h% 1 8. The integrated voice activation detector of 

g.." 2 claim 1, wherein detecting whether noise is present to 

Q 

m 3 determine whether a noise flag should be set includes 

y 4 determining whether energy in a current frame multiplied 

M» 5 by a threshold is greater than delayed frame energy. 

1 9. The integrated voice activation detector of 

2 claim 1, wherein detecting whether a threshold amount of 

3 energy is present to determine whether an energy flag 

4 should be set includes determining if a logarithm of an 

5 autocorrelation of a frame is greater than an energy 

6 threshold. 
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10. The integrated voice activation detector of 
claim 1, wherein detecting whether instantaneous energy 
is present to determine whether an instantaneous energy 
flag should be set includes determining whether a 
difference between a current frames energy at an 
autocorrelation of a tenth delayed sample and a prior 
frames energy at an autocorrelation of a tenth delayed 
sample is greater than a previous frames autocorrelation 
multiplied by a threshold. 

11. A method for voice activation detection to 
detect whether voice is present, the method comprising: 

detecting whether noise is present to 
determine whether a noise flag should be set; 

detecting a predetermined number of 
zero crossings to determine whether a zero 
crossing flag should be set; 

detecting whether a threshold amount of 
energy is present to determine whether an 
energy flag should be set; 

detecting whether instantaneous energy 
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is present to determine whether a instantaneous 
energy flag should be set; and 

utilizing a combination of the noise, 
zero crossing, energy, and instantaneous energy 
flags to determine whether voice is present. 

12. The method of claim 11, further comprising, 
performing fast Fourier transformation (FFT) processing 
to determine whether a FFT flag should be set. 

13. The method of claim 11, further comprising, 
performing an interim voice activity decision, a interim 
voice activity decision flag being set to indicate that 
voice has been detected by determining if the 
instantaneous energy flag is set or the energy flag is 
set and the noise flag is not set and the zero crossing 
flag is not set. 

14. The method of claim 13, further comprising, 
performing HangOver and Speech Kick in processing after 
the interim voice activity decision has been made to 
determine whether a voice activity flag should be set or 
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5 



cleared . 



1 15. The method of claim 14, further comprising, if 

2 the voice activity flag is set, sending a speech payload 

3 to be packetized and updating the voice activity 

4 detection flag for external interaction with other 

5 functions. 

1 16. The method of claim 14, further comprising, if 

2 the voice activity flag is not set, disabling an 

3 automatic level control and causing a silence insertion 

4 description payload to be prepared. 

1 17. The method of claim 11, wherein detecting a 

2 predetermined number of zero crossings to determine 

3 whether a zero crossing flag should be set includes 

4 determining whether a root mean square crossing value is 

5 greater than a threshold value. 

1 18. The method of claim 11, wherein detecting 

2 whether noise is present to determine whether a noise 

3 flag should be set includes determining whether energy in 
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a current frame multiplied by a threshold is greater than 
delayed frame energy. 



1 19. The method of claim 11, wherein detecting 

2 whether a threshold amount of energy is present to 

3 determine whether an energy flag should be set includes 
^ 4 determining if a logarithm of an autocorrelation of a 

J* 5 frame is greater than an energy threshold, 

fn 

ih 1 20. The method of claim 11, wherein detecting 

s 2 whether instantaneous energy is present to determine 

fj 3 whether an instantaneous energy flag should be set 

11 ! 

yd 4 includes determining whether a difference between a 

m. * 

h& 5 current frames energy at an autocorrelation of a tenth 

6 delayed sample and a prior frames energy at an 

7 autocorrelation of a tenth delayed sample is greater than 

8 a previous frames autocorrelation multiplied by a 

9 threshold. 



1 21. An apparatus comprising: 

2 at least one signal processing unit to perform 

3 voice detection; and 
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a storage device to store signal processing 
instructions for execution by the at least one 
signal processing unit to: 

determine whether a noise flag, a zero 
crossing flag, an energy flag, and an 
instantaneous energy flag should be set; and 

utilize a combination of the noise, 
zero crossing, energy, and instantaneous energy 
flags to determine whether voice is present. 

22. The apparatus of claim 21, wherein the signal 
processing instructions further for execution by the at 
least one signal processing unit to: 

detect whether noise is present to 
determine whether the noise flag should be set; 

detect a predetermined number of zero 
crossings to determine whether the zero 
crossing flag should be set; 

detect whether a threshold amount of 
energy is present to determine whether the 
energy flag should be set; and 

detect whether instantaneous energy is 
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present to determine whether the instantaneous 
energy flag should be set . 



1 23. The apparatus of claim 21, wherein the signal 

2 processing instructions further for execution by the at 

3 least one signal processing unit to, perform fast Fourier 

4 transformation (FFT) processing to determine whether a 
€1 5 FFT flag should be set. 

H* 1 24. The apparatus of claim 21, wherein the signal 

4* 2 processing instructions further for execution by the at 

y 3 least one signal processing unit to, perform an interim 

l*i 4 voice activity decision, a interim voice activity 

ir ; -f 5 decision flag being set to indicate voice has been 

6 detected by determining if the instantaneous energy flag 

7 is set or the energy flag is set and the noise flag is 

8 not set and the zero crossing flag is not set. 

1 25. The apparatus of claim 24, wherein the signal 

2 processing instructions further for execution by the at 

3 least one signal processing unit to, perform HangOver and 

4 Speech Kick in processing after the interim voice 
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6 



activity decision has been made to determine whether a 
voice activity flag should be set or cleared. 





1 


26. The apparatus of claim 25, wherein the signal 




2 


processing instructions further for execution by the at 




3 


least one signal processing unit to, if the voice 




4 


activity flag is set, send a speech payload to be 


"War 


5 


packetized and update the voice activity detection flag 




6 


for external interaction with other functions of the 




7 


semiconductor integrated circuit. 



1 27. The apparatus of claim 25, wherein the signal 

2 processing instructions further for execution by the at 

3 least one signal processing unit to, if the voice 

4 activity flag is not set, disable an automatic level 

5 control and cause a silence insertion description payload 

6 to be prepared. 

1 28. The apparatus of claim 22, wherein detecting a 

2 predetermined number of zero crossings to determine 

3 whether a zero crossing flag should be set includes 

4 determining whether a root mean square crossing value is 
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greater than a threshold value. 



1 29. The apparatus of claim 22, wherein detecting 

2 whether noise is present to determine whether a noise 

3 flag should be set includes determining whether energy in 

4 a current frame multiplied by a threshold is greater than 

5 delayed frame energy. 

1 30. The apparatus of claim 22, wherein detecting 

2 whether a threshold amount of energy is present to 

3 determine whether an energy flag should be set includes 

4 determining if a logarithm of an autocorrelation of a 

5 frame is greater than an energy threshold. 

1 31. The apparatus of claim 22, wherein detecting 

2 whether instantaneous energy is present to determine 

3 whether an instantaneous energy flag should be set 

4 includes determining whether a difference between a 

5 current frames energy at an autocorrelation of a tenth 

6 delayed sample and a prior frames energy at an 

7 autocorrelation of a tenth delayed sample is greater than 

8 a previous frames autocorrelation multiplied by a 



042390. P12534 

Express Mail; EL802874811US 



133 



9 threshold. 



1 32. A method comprising: 

2 determining whether a noise flag, a zero 

3 crossing flag, an energy flag, and an 

4 instantaneous energy flag should be set; and 

5 utilizing a combination of the noise, zero 

6 crossing, energy, and instantaneous energy 

7 flags to determine whether voice is present. 

1 33. The method of claim 3 2, further comprising: 

2 detecting whether noise is present to 

3 determine whether the noise flag should be set; 

4 detecting a predetermined number of zero 

5 crossings to determine whether the zero 

6 crossing flag should be set; 

7 detecting whether a threshold amount of 

8 energy is present to determine whether the 

9 energy flag should be set; and 

10 detecting whether instantaneous energy is 

11 present to determine whether the instantaneous 

12 energy flag should be set . 
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34. The method of claim 33, further comprising, 
performing fast Fourier transformation ( FFT) processing 
to determine whether a FFT flag should be set. 

35. The method of claim 32, further comprising, 
performing an interim voice activity decision, a interim 
voice activity decision flag being set to indicate that 
voice has been detected by determining if the 
instantaneous energy flag is set or the energy flag is 
set and the noise flag is not set and the zero crossing 
flag is not set. 

36. The method of claim 35, further comprising, 
performing HangOver and Speech Kick in processing after 
the interim voice activity decision has been made to 
determine whether a voice activity flag should be set or 
cleared . 

37. The method of claim 36, further comprising, if 
the voice activity flag is set, sending a speech payload 
to be packetized and updating the voice activity 
detection flag for external interaction with other 
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functions . 

38. The method of claim 36, further comprising, if 
the voice activity flag is not set, disabling an 
automatic level control and causing a silence insertion 
description payload to be prepared. 

39. The method of claim 33, wherein detecting a 
predetermined number of zero crossings to determine 
whether a zero crossing flag should be set includes 
determining whether a root mean square crossing value is 
greater than a threshold value. 

40. The method of claim 33, wherein detecting 
whether noise is present to determine whether a noise 
flag should be set includes determining whether energy in 
a current frame multiplied by a threshold is greater than 
delayed frame energy. 

41. The method of claim 33, wherein detecting 
whether a threshold amount of energy is present to 
determine whether an energy flag should be set includes 
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determining if a logarithm of an autocorrelation of a 
frame is greater than an energy threshold. 

42. The method of claim 33, wherein detecting 
whether instantaneous energy is present to determine 
whether an instantaneous energy flag should be set 
includes determining whether a difference between a 
current frames energy at an autocorrelation of a tenth 
delayed sample and a prior frames energy at an 
autocorrelation of a tenth delayed sample is greater than 
a previous frames autocorrelation multiplied by a 
threshold . 

43. A machine-readable medium having stored thereon 
instructions, which when executed by a machine, causes 
the machine to perform operations comprising: 

determining whether a noise flag, a zero 
crossing flag, an energy flag, and an 
instantaneous energy flag should be set; and 

utilizing a combination of the noise, zero 
crossing, energy, and instantaneous energy 
flags to determine whether voice is present. 
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1 44. The machine-readable medium of claim 43, further 

2 comprising: 

3 detecting whether noise is present to 

4 determine whether the noise flag should be set; 

5 detecting a predetermined number of zero 

6 crossings to determine whether the zero 

7 crossing flag should be set; 

8 detecting whether a threshold amount of 

9 energy is present to determine whether the 

10 energy flag should be set; and 

11 detecting whether instantaneous energy is 
j^i 12 present to determine whether the instantaneous 

13 energy flag should be set. 

1 45. The machine -readable medium of claim 43 , further 

2 comprising, performing fast Fourier transformation (FFT) 

3 processing to determine whether a FFT flag should be set. 

1 46. The machine-readable medium of claim 43 , further 

2 comprising, performing an interim voice activity 

3 decision, a interim voice activity decision flag being 

4 set to indicate that voice has been detected by 
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determining if the instantaneous energy flag is set or 
the energy flag is set and the noise flag is not set and 
the zero crossing flag is not set. 



1 47. The machine-readable medium of claim 46, further 

2 comprising, performing HangOver and Speech Kick in 

3 processing after the interim voice activity decision has 

4 been made to determine whether a voice activity flag 

5 should be set or cleared. 

1 48. The machine- readable medium of claim 47 , further 

2 comprising, if the voice activity flag is set, sending a 

3 speech payload to be packetized and updating the voice 

4 activity detection flag for external interaction with 

5 other functions. 

1 49. The machine -readable medium of claim 47, further 

2 comprising, if the voice activity flag is not set, 

3 disabling an automatic level control and causing a 

4 silence insertion description payload to be prepared. 

1 50. The machine-readable medium of claim 44, wherein 
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detecting a predetermined number of zero crossings to 
determine whether a zero crossing flag should be set 
includes determining whether a root mean square crossing 
value is greater than a threshold value. 

51. The machine-readable medium of claim 44, wherein 
detecting whether noise is present to determine whether a 
noise flag should be set includes determining whether 
energy in a current frame multiplied by a threshold is 
greater than delayed frame energy. 

52. The machine- readable medium of claim 44, wherein 
detecting whether a threshold amount of energy is present 
to determine whether an energy flag should be set 
includes determining if a logarithm of an autocorrelation 
of a frame is greater than an energy threshold. 

53. The machine- readable medium of claim 44, wherein 
detecting whether instantaneous energy is present to 
determine whether an instantaneous energy flag should be 
set includes determining whether a difference between a 
current frames energy at an autocorrelation of a tenth 
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6 delayed sample and a prior frames energy at an 

7 autocorrelation of a tenth delayed sample is greater than 

8 a previous frames autocorrelation multiplied by a 

9 threshold. 



35 
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